Lambek vs. Lambek: Functorial Vector Space Semantics and String Diagrams for Lambek Calculus
نویسندگان
چکیده
The Distributional Compositional Categorical (DisCoCat) model is a mathematical framework that provides compositional semantics for meanings of natural language sentences. It consists of a computational procedure for constructing meanings of sentences, given their grammatical structure in terms of compositional type-logic, and given the empirically derived meanings of their words. For the particular case that the meaning of words is modelled within a distributional vector space model, its experimental predictions, derived from real large scale data, have outperformed other empirically validated methods that could build vectors for a full sentence. This success can be attributed to a conceptually motivated mathematical underpinning, something which the other methods lack, by integrating qualitative compositional type-logic and quantitative modelling of meaning within a category-theoretic mathematical framework. The type-logic used in the DisCoCat model is Lambek’s pregroup grammar. Pregroup types form a posetal compact closed category, which can be passed, in a functorial manner, on to the compact closed structure of vector spaces, linear maps and tensor product. The diagrammatic versions of the equational reasoning in compact closed categories can be interpreted as the flow of word meanings within sentences. Pregroups simplify Lambek’s previous type-logic, the Lambek calculus. The latter and its extensions have been extensively used to formalise and reason about various linguistic phenomena. Hence, the apparent reliance of the DisCoCat on pregroups has been seen as a shortcoming. This paper addresses this concern, by pointing out that one may as well realise a functorial passage from the original type-logic of Lambek, a monoidal bi-closed category, to vector spaces, or to any other model of meaning organised within a monoidal bi-closed category. The corresponding string diagram calculus, due to Baez and Stay, now depicts the flow of word meanings, and also reflects the structure of the parse trees of the Lambek calculus. 1 ar X iv :1 30 2. 03 93 v1 [ m at h. L O ] 2 F eb 2 01 3
منابع مشابه
Categorical Foundations for Extended Compositional Distributional Models of Meaning
Compositional distributional models of meaning were introduced by Coecke et al. (2010, 2013) with the aim of reconciling the theory of distributional meaning in terms of vector space semantics with the theory of compositional interpretation as one finds it in typelogical grammars. The particular typelogical formalisms employed by Coecke et al. (pregroup grammars, Lambek calculus) have a recogni...
متن کاملRelational Semantics for the Lambek-Grishin Calculus
We study ternary relational semantics for symmetric versions of the Lambek calculus with interaction principles due to Grishin (1983). We obtain completeness on the basis of a Henkin-style weak filter construction. 1 Background, motivation The categorial calculi proposed by Lambek and their current typelogical extensions respect an “intuitionistic” restriction: in a Gentzen presentation, Lambek...
متن کاملRelational Semantics of the Lambek Calculus Extended with Classical Propositional Logic
We show that the relational semantics of the Lambek calculus, both nonassociative and associative, is also sound and complete for its extension with classical propositional logic. Then, using filtrations, we obtain the finite model property for the nonassociative Lambek calculus extended with classical propositional logic.
متن کاملFree monoid completeness of the Lambek calculus allowing empty premises
We prove that the Lambek syntactic calculus allowing empty premises is complete with respect to the class of all free monoid models (i. e., the class of all string models, allowing the empty string).
متن کاملComplexity of the Lambek Calculus and Its Fragments
• The Lambek calculus (denoted L) is a mathematical tool for formal language specification. It generates the class of all context-free languages without the empty word. • The Lambek calculus with empty antecedents (denoted L∗) generates the class of all context-free languages. • Proof nets provide a convenient criterion for derivability in L∗. • The derivability problems for L∗(\, /) and L(\, /...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Ann. Pure Appl. Logic
دوره 164 شماره
صفحات -
تاریخ انتشار 2013